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Abstract — Because networks can be used to represent many 
complex systems, they have attracted considerable attention in 
physics, computer science, sociology, and many other disciplines. 
One of the most important areas of network science is the 
algorithmic detection of cohesive groups (i.e., "communities") 
of nodes. In this paper, we algorithmically detect communities 
in social networks and image data by optimizing multislice 
modularity. A key advantage of modularity optimization is that 
it does not require prior knowledge of the number or sizes of 
communities, and it is capable of finding network partitions that 
are composed of communities of different sizes. By optimizing 
multislice modularity and subsequently calculating diagnostics 
on the resulting network partitions, it is thereby possible to 
obtain information about network structure across multiple 
system scales. We illustrate this method on data from both social 
networks and images, and we find that optimization of multislice 
modularity performs well on these two tasks without the need 
for extensive problem-specific adaptation. However, improving 
the computational speed of this method remains a challenging 
open problem. 

Index Terms — clustering algorithms; network theory (graphs); 
image segmentation 

I. Methods 

Many networks can be partitioned into communities, such 
that they consist of cohesive (and often dense) groups of 
vertices with sparse connections between distinct groups Q. 
Perhaps the most popular way of detecting communities 
algorithmically is by optimizing the quality function known 
as modularity 0: 
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which measures how well a network can be partitioned into 
disjoint groups of nodes. In |T]), are the elements of 
the graph's adjacency matrix A, the sum of all of the edge 
weights in the network is m, ki is the strength (i.e., weighted 
degree) of node i, and the resolution parameter 7 [ 3 ] enables 
us to uncover community structure at different scales. The 
modularity of a network partition measures the fraction of 
total edge weight within communities minus that expected 
if edges were placed randomly according to the null model 
Pij = kikj/(2m), which preserves a network's expected 
strength distribution. Finding a network partition that attempts 
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Fig. 1. (a) Schematic of a multislice network. (We reproduce this image from 
1 4 1 with permission from the authors.) (b) Image of a pair of cows, which we 
downloaded from the Microsoft Research Cambridge Object Recognition Im- 
age Database 1 10 1 (copyright © 2005 Microsoft Corporation). It is modified 
to produce the segmentation in Fig. [2] 



to maximize Q allows one to probe a network's community 
structure. In contrast to traditional forms of spectral clustering, 
modularity optimization requires no knowledge of the number 
or sizes of communities, and it also allows one to segment a 
network into communities of disparate sizes (even for a fixed 
value of 7) Q, 0. 

Optimization of modularity was recently generalized to 
"multislice" networks (U, which are represented using ad- 
jacency tensors and consist of layers of ordinary networks. 
The framework of multislice networks can thereby be used to 
represent time-dependent or multiplex networks. In Fig. [IJa)> 
we show a schematic of a multislice network. Using this 
framework, we define a generalized modularity function (U 
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where gj r indicates that community assignment of node j 
from slice r, the intraslice edge strength of node j in slice s 
is kj S = J2iAij s , the corresponding interslice edge strength 

is c j3 = J2 r C jsr, and 2/i = V > k j 



In 0, 



one can 



use a different resolution parameter 7 S in each slice. For a 
given slice s, the quantity Aij s gives the edge weight between 
nodes i and j. For a given node j, the quantity Cj sr gives 
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Fig. 2. (a) Segmentation of the cow image in Fig. ^ (b) obtained using 
optimization of multislice modularity with interslice coupling parameter uj = 
0.3. The horizontal axis shows the slice index s £ {1, . . . ,6}, which has 
an associated resolution-parameter value of 7 S = 0.04s — 0.03. The vertical 
axis gives the sorted pixel index. Color in each vertical stripe indicates the 
community assignments of the pixels in the corresponding network slice. We 
also show the segmentation that we obtain in the images for (b) 7 S = 0.05, 
(c) 7 S = 0.13, and (d) 7 S = 0.21. 

the interslice coupling between the rth and sth slices. 

Optimization of the ordinary modularity function ([T} has 
been used to study community structure in myriad networks 
fTl . and it has also been used in the analysis of hyperspectral 
images [ 5 ] recently. In our work, we optimize multislice mod- 
ularity ^ to examine community structure in social networks 
and segmentation of images. In each case, we start with a static 
graph, and each layer of the multislice network uses the same 
adjacency matrix but associates it with a different resolution- 
parameter value 7 S . We include interslice edges between each 
node j in adjacent slices only, so Cj sr = unless \r — s\ = 1. 
We set all nonzero interslice edges to a constant value uj. 
This setup, which was illustrated using the infamous Zachary 
Karate Club network in (H, allows one to detect communities 
using a range of resolution parameter values while enforcing 
some consistency in clustering identical nodes similarly across 
slices. The strength of this enforcement becomes larger as one 
increases uj. To optimize multislice modularity ([2]), we use a 
Louvain-like locally-greedy algorithm l6l. 171. 

II. Data and Results 

A. LAPD Field Interview Data 

In Qj]], we used data with both geographic and social infor- 
mation about stops involving street gang members in the Los 
Angeles Police Department (LAPD) Division of Hollenbeck 
1S1 . We optimized multislice modularity ^ as a means of 
unsupervised clustering of individual gang members without 
prior knowledge of the number of gangs or affiliation of 
the members. We subsequently examined network diagnostics 



over slices to attempt to estimate the number of gangs that is 
stable across multiple resolution-parameter values and that also 
corresponds roughly to the number expected by the LAPD. 

B. Cow Image 

We segment the cow image in Fig. [TJb) (which contains 
about 3 x 10 4 pixels) without specifying the number of image 
components. We build a graph of this image in which each 
node corresponds to a pixel and each edge indicates the 
similarity between a pair of pixels. We associate a 3 x 3 pixel- 
neighbor patch with each pixel i in the image. Let po(hj) 
denote the L2 norm of the difference of patches corresponding 
to nodes i and j. The adjacency matrix A that we use in each 
layer of the multislice network has elements 

where r(i) is the 30th smallest between pixel i and other 
pixels [9 ]. We construct a multislice network that consists of 
six copies of A. We associate the resolution parameter value 
7 S = 0.045 — 0.03 with slice s G {1, . . . , 6}. We then optimize 
multislice modularity and obtain the image segmentations 
shown in Fig. [2] (Color indicates group assignments.) With 
this procedure, we are able to identify all four components of 
the image. As indicated in panel (a), we obtain smaller-scale 
communities (i.e., groups of pixels) as we increase the value 
of the resolution parameter. Importantly (see the discussion 
in Section [I]), the coupling between slices enforces some 
consistency in clustering identical nodes similarly across 
slices. In panels (b) and (c), we observe a good segmentation 
of the two cows, the sky, and the background grass. As 
indicated in panel (d), the three groups corresponding to the 
two cows and the sky stay relatively stable, but the group 
corresponding to the grass breaks down by the sixth slice. 

This application on image segmentation is computationally 
expensive due to the large number of pixels. It takes a lot 
of computational memory and time to run the optimization 
using more slices, which we would like to do in order to 
investigate how the segmentation evolves over a larger range 
of resolution values. Computational improvements will be 
necessary to conduct more detailed analysis. 

III. Future Directions 

As mentioned above, optimization of multislice modularity 
can be computationally expensive. As the size of network data 
has increased tremendously, it is crucial to develop efficient 
algorithms to cluster network nodes to obtain insights on 
applications like social networks and images. To do this, one 
needs to take advantage of data sparsity to help speed up 
optimization processes. Aside from the computational cost, 
how to characterize and analyze the performance of modularity 
optimization is of importance as well. 
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